SOMLib: A Distributed Digital Library System based on Self-Organizing Maps
نویسنده
چکیده
We describe an architecture for a distributed digital library system based on an unsupervised neural network model, namely the Self-Organizing Map. The system allows the clustering of text documents forming the basis for intelligent information retrieval. User prooles can be combined with full text queries or sample texts to locate documents within the library system. Contrary to conventional approaches consisting of one central library, the distributed SOMLib allows the combination and integration of several independent SOM-based library systems in a not necessarily hierarchical order. Furthermore, the system is not limited to a speciic type or setup of the basic SOM architecture. Rather, variations of the basic architecture which might serve special needs of some users can be combined.
منابع مشابه
The SOMLib Digital Library System
Digital Libraries have gained tremendous interest with several research projects addressing the wealth of challenges in this eld. While computational intelligence systems are being used for speciic tasks in this arena, the majority of projects relies on conventional techniques for the basic structure of the library itself. With the SOMLib project we created a digital library system that uses a ...
متن کاملAdding SOMLib Capabilities to the Greenstone Digital Library System
Many conventional digital library systems offer access to their collections only via full text or meta-data search, or by browsingaccess via a hierarchy of categories. With the increasing amount of digital content available, alternative methods to access the content seem necessary. The SOMLib system, which is based on using Self-Organizing Maps (SOMs), has been used to automatically organize do...
متن کاملCreating an Order in Distributed Digital Libraries by Integrating Independent Self-Organizing Maps
Digital document libraries are an almost perfect application arena for un-supervised neural networks. This because many of the operations computers have to perform on text documents are classiication tasks based on \noisy" input patterns. The \noise" arises because of the known inaccuracy of mapping natural language to an indexing vocabulary representing the contents of the documents. A growing...
متن کاملUser Interfaces for Digital Libraries
Digital Libraries have gained tremendous interest with several research projects addressing the wealth of challenges in this field. While computational intelligence systems are being used for specific tasks in this arena, the majority of projects relies on conventional techniques for the basic structure of the library itself. With the SOMLib project we created a digital library system that uses...
متن کاملTowards Automatic Content-based Organization of Multilingual Digital Libraries: an English, French, and German View of the Russian Information Agency Novosti News
In this paper we present the application of the SOMLib digital library system to a multilingual document corpus from the Russian Information Agency Novosti. News articles in Russian, English, and German are automatically organized into separate topic hierarchies using a novel unsupervised neural network, namely the Growing Hierarchical Self-Organizing Map. Furthermore, machine translation is us...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1998